Invariant matchings of exponential tail on coin flips in Z 

by Adam Timar 



Abstract. Consider Bernoulli(l/2) percolation on Z d , and define a perfect 
matching between open and closed vertices in a way that is a deterministic 
equivariant function of the configuration. We want to find such matching rules 
that make the probability that the pair of the origin is at distance greater than 
r decay as fast as possible. For two dimensions, we give a matching of decay 
cr 1 , which is optimal. For dimension at least 3 we give a matching rule that 
has an exponential tail. This substantially improves previous bounds. The 
construction has two major parts: first we define a sequence of coarser and 
coarser partitions of Z d in an equivariant way, such that with high probability 
the cell of a fixed point is like a cube, and the labels in it are i.i.d. Then 
we define a matching for a fixed finite cell, which stabilizes as we repeatedly 
apply it for the cells of the consecutive partitions. Our methods also work in 
the case when one wants to match points of two Poisson processes, and they 
may be applied to allocation questions. 



§1. Introduction. 

Fix (fi, E, P), where O = {0, 1} Z , E is the product a-algebra, and P is the product 
of Bernoulli measures with parameter 1/2. We prove the following theorems. 

Theorem 1.1. For d = 1,2, there exists a deterministic perfect matching <pu, = (j) between 
{x G Z d : u(x) = 0} and {x G 7L d : u(x) = 1}, such that for almost every u G O , <fi is an 
equivariant function (i.e., tp^ = (j> g (w) f or every translation g ofZ d ), and for any r > 0, 

c 



P;dist {o.o(o) ) >r]< il/2 



with some constant c. 



Theorem 1.2. Consider d > 3, and e > arbitrary. Then there exists a deterministic 
perfect matching W = <fi between {x G 7L d : u(x) = 0} and {x G 7L d : u(x) = 1}, such 
that for almost every ui G Q, (p is an equivariant function, and for any r > 0, 



P[dist(o,0(o)) > r] < Cexp(-cr d - 2 - e ). 
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The bound in Theorem 1.2 can be slightly tightened, see Remark 3.30. We also have 
some ideas that could possibly remove the "— e" from the bound, but at the cost of much 
extra complication. However, the correct magnitude of the exponent is not known: the 
only lower bound is the trivial C exp (-cr d ). 

For d = 2 Theorem 1.1 is new; the best known result has been that of [S]. Note that 
for cZ = 1,2, Theorem 1.1 is essentially tight by a theorem of [HP2], which says that for 
any matching rule 0, E[dist(o, (p(o)) d / 2 ] = oo for these dimensions. For higher dimensions 
it was believed by Holroyd and Peres that there would be an exponential bound (see [HP2] 
and also [S]). 

Our proofs rely on the following theorem, which is of independent interest. Infor- 
mally, it claims that there is a sequence of coarser and coarser partitions for the space 
that are deterministic functions of the point configuration, and still most of the cells are 
(approximate) cubes with i.i.d. Bernoulli labels in them. 

A subset X of Z d (R d ) is called a /c-pseudocube, if X contains some [k/2] x . . . x [k/2] 
cube, and X is contained in some 2k x ... x 2k cube (these two are referred to as the 
volume condition), and finally, if \dX\ < cq\X\^tt , with cq = 2 2d + 2 d d (referred to as the 
isoperimetry condition). This choice for Co is rather arbitrary, and any greater constant 
could be used; in particular, it is clear that the intersection of Z d with a pseudocube of R d 
is a pseudocube itself, with a c that is only worse by some constant factor. 

Theorem 1.3. Fix o e 7L d . There exists a sequence {Qi} of coarser and coarser partitions 
of Z d that are equivariant functions of the configuration to e Q, and such that there is an 
event Ai with P[Ai] > 1 — c2~ 2 , such that conditioned on Ai, the following hold: 

(i) For each j > i, the cell Cj of o in Qj is a 2 J -pseudocube; 

(ii) conditioned further on the location of Cj, the labels in Cj are i.i.d. Bernoulli (1/2). 
(Hi) There is some infinite subsequence Q a (k) such that if i = a(k) then Ci is a 2 % by 2 l 

cube. 

In fact the probability of A, with the above properties can be made arbitrarily large 
by an appropriate choice of the parameters in our construction. 
Theorem 1.1 is a straightforward corollary of Theorem 1.3. 

Proof of Theorem 1.1. Given the sequence of partitions, the matching is defined similarly 
to [S]. Namely, consider the sequence Qi from Theorem 1.3, and as i = 1, 2, . . ., for each 
2 l -pseudocube, match as many yet unmatched points as possible, each with a point of 
opposite label, but otherwise arbitrarily. The central limit theorem gives the claim. I 

A question similar to the ones above is when one considers a Poisson point process 
(the natural generalization of a set of uniformly distributed points in the unit cube to an 
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infinite domain), and colors each of the configuration points independently red or blue 
with probability 1/2. This is the same as taking two independent Poisson point processes 
of the same intensity. Our goal in this setting again is to give an "optimal" perfect 
matching between the red and blue points, by some matching rule that is a deterministic 
and equivariant function of the random point set. (Informally, the matching rule is defined 
using the locations and colors of the configuration points, but no background information 
from the underlying space.) By an optimal matching we meant in the Z d case that the 
function F(r) := P [distance of from its pair is > r] decays as fast as possible. For the 
Poisson case, similarly, we condition on that is a configuration point, and want to make 
the tail F(r) := P [distance of from its pair is > r | G uj] tend to fast. The setting 
of our question shows that requiring the matching rule to be invariant is natural, since it 
essentially means that F(r) does not change if we replace by any other point. 

We will phrase and prove our theorems for the Z d case, but our methods can be easily 
adjusted to the matching problem for Poisson point processes. Furthermore, a sequence 
of partitions as in Theorem 1.3 can be obtained as an equivariant function of one Poisson 
point process, by the natural modifications if our proof. Because of possible applications, 
let us state this separately. 

Theorem 1.4. Fix o e M. d . There exists a sequence {Qi} of coarser and coarser partitions 
of R d that are equivariant functions of the configuration uj of a Poisson point process on 
]R d , and such that there is an event with P[Aj] > 1 — c2~ 2 , such that conditioned on 
Ai, the following hold: 

(i) For each j > i, the cell Cj of o in Qj is a polyhedron and a 2 J -pseudocube; 

(ii) for any Lebesgue measurable A C P[\w H A n Cj| | Cj] = Poisson(|A n Ci\). 

(Hi) There is some infinite subsequence Q a (k) such that if i = a(k) then Ci is a 2 l by 2 l 
cube. 

Here Poisson(A) denotes the Poisson distribution of intensity A. 

The question that we address was first asked by Holroyd and Peres in [HP2]. The 
best results were of order r~ s ( d \ where s(2) < 2/3, and s(d) < 2 for any d, [S]. Similar, 
but considerably simpler matching problems were fully solved in [HPPS]; we will go into 
the details later in this introduction. Our question is related to two intensively studied 
families of problems. First, invariant measurable functions (factors) of point processes 
have been of interest from a statistical point of view (e.g. Palm processes, allocations; see 
[Th], [HL] and further references therein), from the optimization aspect (such as minimal 
spanning trees, [Ale]), and from a more general interest about how much information can 
be extracted from a point process ([HP1] being a seminal paper in the area). Close relatives 
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to the matching questions treated here are the so called allocation questions, when one 
has to assign disjoint sets of measure 1 to every point of a Poisson point process of intensity 
1 so that this partition of R d is a factor. We will return to this later. The other related 
branch of problems, from a different direction, is finding an optimal matching between 
independently distributed points in a unit cube. While this latter field is almost fully 
explored (as a result of work by Ajtai-Komlos-Tusnady, Talagrand, Shor, Yukich; see [Y], 
[Ta] for surveys and a still standing challanging problem by Talagrand) , the methods there 
do not seem to apply for our setting, because of the difficulties arising from the infinite 
setting and invariance. 

While the matching rule we are looking for has to be a deterministic function of the 
configuration, one may relax this requirement and allow extra randomness. The additional 
freedom we gain this way is considerable. An example is that using extra randomness 
allows one to partition R d (Z d ) to "nice" subsets (e.g. cubes) with independent labels 
in it: simply take kZ d + v with v G [0, k] d chosen uniformly. (This partition cannot be 
defined as a factor, by ergodicity.) A partition with these properties enables one to use 
local matching rules, repeated for countably many, coarser and coarser partitions. 

A variant setting to our problem is when, instead of matching points of two colors, we 
have one Poisson point process, and want to find a perfect matching of optimal tail on its 
points. (For the Z d case the analogous problem is meaningless.) Call this a 1-matching 
problem to distinguish from the 2-matching problem defined earlier. Similarly to the 
matching question in a finite box, [Y], the 1-matching problem is much simpler than the 
2-color case. The main reason for this is that much of the difficulties in the 2-color problem 
is coming from the difference between the number of vertices of the two colors within some 
given box. This discrepancy is around the square root of the number of points in the box, 
and it gives a lower bound to the number of points that cannot potentially be matched 
within the box. Since these points have to find their pairs beyond the boundary of the 
box, isoperimetry starts playing a role, and this is responsible for the dramatic change 
between dimensions < 2 and > 3, as seen in Theorem 1.1 and Theorem 1.2. Obviously, 
the difficulty coming from discrepancies does not arise in the 1-color case. 

Let us summarize briefly, what has been known about the four problems given by 1 
and 2-color matchings, with randomized or deterministic matching schemes. See [HPPS] 
for a detailed account and an instructive table. The randomized 1-color matching has a 
sharp tail of order exp(— cr d ) for all dimensions. Similarly, for the randomized 2-color 
problem [HPPS] obtained a sharp < cr~ d l 2 decay for d = 1, 2, and < exp(— cr d ) decay for 
d > 3. For the 1-color factor matching, the tight bounds are of order < cr~ l for d = 1, 
and < exp(— cr d ) for d > 2. For the 2-color factor matching, the best known upper bound 
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was r~ s ^ with s(d) < 2/(1 + 4/d), [S]. 

Let us point at another interesting phenomenon about the decay rates of various 
problems. First, there is a big gap between the optimal decay rates of the randomized and 
the deterministic 1-color matching problems in dimension 1. Since the distance between 
two neighboring configuration points of a Poisson point process in 1 dimension follows an 
exponential distribution, it is rather the slow, linear decay for the factor matching that is 
surprising. It sheds some light on how restrictive the requirement of giving a deterministic 
perfect matching is. 

Although Theorem 1.2 is a big improvement to earlier results, the degree of the optimal 
rate of decay is still open. A trivial lower bound is the following: 

Lemma 1.5. For any 2-matching scheme on 7h d 

E[exp(cr d )] = oo, (1.6) 

where r is the distance between the origin and its pair, and c is some positive constant. 

Similar statement holds for the Poisson mentioned before. For a proof observe 

that the distance of the origin to the closest point of opposite color is a lower bound for r, 
and this already satisfies (1.6). 

Though allocation questions have a flavor similar to matching questions, we do not 
know of any direct connection that would make them equivalent in some sense. If ui and 
u>2 are the configurations of two independent Poisson point processes of intensity 1 (call 
them yellow and blue points), and Ai is a deterministic, invariant allocation rule (i = 1,2), 
so that Ai(x) is a set of measure 1 for each x G Ui, and these partition R d for each i, then 
define a bipartite graph V = (V, E); V = u\ U ui2- Namely, let x\ and xi be adjacent if 
X\ G Ui, X2 G Cc>2? and A(x\) D A{x2) ^ 0- Then Konig's theorem (generalized to locally 
finite infinite graphs) implies that there is a perfect matching in V. Hence, as observed 
by Holroyd and Peres, if one could define a perfect matching for Y in an invariant way, 
then one would get a matching rule between uj\ and U2 that has essentially the same tail 
behavior as the allocation rule (at least for tails that decay relatively fast). Hence, the 
existence of an invariant perfect matching could be used to give a perfect matching from an 
allocation rule. The best known allocation rule so far is the so-called gravitation allocation, 
[CPPR]. Although the matching scheme presented in the present paper has a better tail 
than what is proved for the gravitation allocation rule in [CPPR], the following question 
is still of interest. 

Conjecture 1.7. Let G = (V,E) be a random locally finite graph with V C R d , and such 
that for any isometry (translation) g ofM. d , (gV,gE) has the same distribution as (V,E). 
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Suppose that there is a perfect matching in G almost always. Then there is also a perfect 
matching that is a deterministic equivariant function of G. 

The conjecture has a similar flavor to one asked by Bowen and Lyons, whether ev- 
ery quasi-transitive planar graph has a periodic 4-coloring, or a question of Lyons and 
Schramm, whether every infinite quasi-transitive graph has an invariant random coloring 
with as many colors as its cromatic number. 

While Conjecture 1.7 leaves the question of creating matching schemes from allocation 
schemes open, the current proof for the matching question provided us with a tool for the 
allocation question. In joint work with Ander Holroyd, we are planning to apply the 
sequence of partitions in Theorem 1.4 to create an allocation rule using the technique in 
[AKT], which we believe may have a tight exp(— cr d ) tail. 

A standard tool for the study of invariant processes is called Mass Transport Principle 
(MTP); see [HPPS] for a version that is close to our setting (Lemma 8), and also for further 
references. We will use the MTP via two of its straightforward corollaries, which we state 
separately: 

Lemma 1.8. Suppose there is a given e > 0, aV invariant partition ofW 1 (Z d ) to mea- 
surable sets, and an invariant measurable subset S of R. d (Z d ). If every C E V satisfies 
\C n S\/\C\ < e, then for any point o ofR d (Z d ) one has P[o G S] < e. 

Lemma 1.9. Let S be some random measurable subset ofM d (Z d ), and M be an invariant 
perfect matching on the points of u. Then P[o or M(o) is in S] < 2P[o is in S]. 

In Section 2 we present a sequence of partitions as in Theorem 1.3. In Section 3 we 
prove Theorem 1.2. Namely, we present a matching algorithm for Bernoulli labelled points 
in some fixed cube and then this is used to give the desired invariant perfect matching on 
Z d . We will repeatedly apply the algorithm for bigger and bigger cubes coming from the 
<2 a (j), using the matching algorithm in a way that we only rematch vertices of smaller and 
smaller density. 

Section 2 and Section 3 are independent, except for that Section 3 uses Theorem 1.3. 
Similar ternms and notation may have different definitions in the two sections. 

In the rest of the paper c and c' always denote positive constants depending only on 
<i, and their values may change from line to line. 
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§2. The sequence of partitions. 

In this section we construct the sequence of partitions of Theorem 1.3. Since we achieve 
this using consequtive sequences of partitions, a look at the summary of Remark 2.11 may 
fascilitate the reader. 

We shall think of Z d as embedded in R d . We shall define Voronoi tessallations of 
IR d , and then other partitions based on that, always using some subset of Z d , chosen as 
a deterministic equivariant function of the labelling. In all these cases there is also an 
inherited partition for Z d , defined by the cells of the tessallation. The reason we prefer to 
partition R d is because it sheds light on how the proof works for Poisson point processes, 
and also because some geometric arguments are simpler to phrase this way. On the other 
hand it is clear that if we construct the desired sequence of partitions for M d , that gives 
rise to a partition for Z d as in Theorem 1.3. For any subset A of IR d , we say that A is 
measurable, if it is Lebesgue measurable. Denote by \A\ its Lebesgue measure. On Z d the 
ex-algebra that we consider is the discrete one, and \A\ stands, as usual, for the number of 
elements in A. If V is some partition, then the sets that it consists of are called the classes 
or cells of V. When o is a point of Z d , the cell that contains o is denoted by V(o). 

We say that a partition P is a refinement of partition Q, if any two elements in the 
same class of P are also in the same class of Q. If P is a refinement of Q, then Q is a 
coarsening of P. By the union of partitions Pi we mean their coarsest common refinement, 
that is, the partition where two elements are in the same class if and only if they are in 
the same class for each Pi. We denote this partition by VPj. Finally, if V is a partition, Q 
is some set of pairwise disjoint subsets of Z d , then the finest common coarsening of V and 
Q is the partition defined by the equivalence relation where two elements are equivalent if 
they belong to the same class of V, or the same set in Q. 

The proof of the next lemma is straightforward. 

Lemma 2.1. If V[ is a sequence of coarser and coarser partitions of 7h d , Q is a set of 
pairwise disjoint subsets ofZ d , and Vi is the finest common coarsening of Q and V[, then 
Vi is a sequence of coarser and coarser partitions. 

In this section balls are understood in the infinity norm (except for two places, where 
we refer to the "usual" ball as the norm-2 ball). That is, the ball B(x,r) of radius r 
around a point x = (x\ , . . . , xj) is {(yi, . . . , yd) £ ^ d '■ maxj|xi — yi\ < r}. Similarly, 
the r-neighborhood of a set H C R d is the union of all the balls of radius r around 
some point of H. Hence the terms "cube" and "ball" stand for the same objects, unless 
otherwise mentioned. 

We denote by Vol(A) the volume of a d dimensional polyhedron A, and by VoLj_i(cL4) 
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the surface area of A. By a path in a graph we always understand a simple path, that is, 
a (finite) sequence of vertices with no repetitions, such that any two consecutive ones are 
connected by an edge. 

If H is some discrete subset of Z d , let V(H) be the Voronoi tessallation of R d deter- 
mined by H. Given a cell of some Voronoi tessallation V(H), we call the (unique) element 
of H in the cell the centre of the cell. 

Denote by Sk the set of points x G Z d with the property that any vertex y such that 
\\x — y\\oo = i,iE:{0,l,..., k}, satisfies ou(y) = (—1)*. Given x G Z d , call the configuration 
on the set {y : \ \x — y\ < k} a /c-bulb of x, if u(y) = ( — l) 1 whenever \\x — y\\oo = h 
i G {0,1, ... ,k}, i.e. if a; G Sk- Thus Sk is defined as an equivariant function of the random 
labelling. A simple but important consequence of the definition is that Li^ =k S n = Sk is 
"sparse": the probability of being in Sk is 2~ ckd . Another reason for the choice of Sk is 
that any two elements of Sk have distance at least 2k from each other, because the /c-bulbs 
of two elements in Sk can intersect only in their boundaries. Hence in V(Sk) every Voronoi 
cell contains a norm-2 ball of radius k. The most important property of /c-bulbs is stated 
in the next lemma. 

Lemma 2.2. Let t G Z+, o be a vertex ofZ d . Let B := B{o,t), and for a vertex x E Z d 
i^t ft,x '■= max(2t, dist(x, o)). Call x a t-giant (of the configuration ui), if the configuration 
on B(x,rt, x ) \ B inherited from u> can be extended to B(x,rt, x ) so that we get an rt, x -bulb 
around x. Then 

P[3 t-giant] < Cc - ^ 

with some C,c> 1. 

Proof. Clearly \B(x,r tjX ) \B\> Lgto^li . Hence the following hold: 

E [number of t— giants] < 2- |B(x ' rt -- ) \ B| < 

x6Z d 

<| fl | 2 -|fl(x,2t)|/2 + J- c 1 z d - 1 2- m ^ li <(2t) d 2- 2d ~ ltd + c x i d ~ x c-/ <Cc~ td , 

i=t+l i=t+l 

with some constants C and c\, c^, c > 1. By Markov's inequality the same upper bound is 
valid for the probability that there exists a t-giant. I 

Given some grid v + rZ d ,v G Z d , r G Z, define the basic cubes of v + rZ d to be the 
cubes that have the form v + {(xi, . . . , xa) G R d : rai < Xi < r(a^ + 1)} with some integers 
at. 

For a C C Z d , denote by d p C the p- neighborhood of the boundary dC of C. Fix a 
point o G Z d . 
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Lemma 2.3. Let K be a convex polyhedron in M, d , and K' be the union of basic cubes of 
rZ d fully contained in K. If K contains a norm-2 ball of radius R around a point o, then 



Yo\(K \ K') 
Yo\(K) 




cr 



with some constant c depending only on the dimension. 

Proof. Any point z of K \ K' has some point of dK in its r-neighborhood, otherwise any 
r x . . . x r cube containing z is contained in K, in particular z G K' . Hence K \ K' is 
contained in d r K, and 



The last inequality is true by the convexity of K. On the other hand, VoLj-i (dK)R < 
Vol(-ftT), which can be seen by subdividing K to pyramids with apice in o (similarly to [Ti]). 



Given a subcube H = \lf =1 [mi,mi + t] of M d , call hyperfaces of the form {x G K : 
Xi = rrii + t}, with some z, right faces. Let V be a partition of ~R d . Define 9(H) with 
respect to V (V treated as a hidden parameter of 9 for simplicity) to be the union of P-cells 
that are contained in H or intersect only right-faces of H. (See Figure 1 for an example.) 

Lemma 2.4. Let H be a 2 l x . . . x 2 l cube of the above form (i.e., with each 1-face parallel 
to some coordinate axis), i' < i, and V be a partition ofM d to convex cells C that satisfy 
one of the following: 

(i) C is a 2 l x . . . x 2 ! cube with each 1-face parallel to some coordinate axis, 

(ii) C has diameter at most 1. 

Then 9(H) satisfies \d9(H)\ < c\dH\ = c'2^ d_1 ) with constants c,c' depending only 
on the dimension. 

Proof. Note that 89(H) is contained in the boundary of some P-cells each of which inter- 
sects dH. These cells are either 2 1 ' x . . . x 2 1 ' cubes (call their set £1), or cells of diameter 
< 1 (call their set S 2 ). Let X £ Si and S(X) = 5 be the minimal number such that 
X intersects some 5 dimensional face of H. Then \X D H\ > 2 1 ' 5 , because the edges of 
X are parallel to the coordinate axes. Since the elements of Si are disjoint, we get that 
I Si I ^ E^J |^xnumber of 5 dimensional hyperfaces of H. That is, < c 2(*- l ')(d-i) 
with some c depending only on d. We obtain: 



Vo\(K \ K') < Yo\(K n d r K) < rVol d _i(dK). 



Putting the two inequalities together gives the statement. 



I 



\39(H)\ < 2d2^ d ~ 1) \Si\ + I U Xe s 2 dX\ < c'^'^h^- 1 '^-^ + | U Xe s 2 OX 
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For the second term here, we can use the crude upper bound \dX\ < 2 d for X E S 2 , and 
\S 2 \ < \dH\. Hence we have 

\dO(H)\ < c2^ d ~ 1) + 2 d \dH\ < c\dH\ = d2^ d ~ l \ (2.5) 

using that H is a 2 l x . . . x 2 % cube. I 
Fix sequences {c^} and {bi} to be bi : = 2 2 * and ai := 2 bi . 

Consider the refinement V k of V(S ak ) where we partition each cell C of V(S ak ) with 
center v(C) using the basic cubes of v(C) + b\JL d '. That is, two points of M, d are in the 
same class (cell) of Vk-, if they are in the same cell C of V(S ak ) and the same basic cube 
of v{C) + bk7h d . By Lemma 2.3 and Lemma 1.8, we obtain: 

Lemma 2.6. The probability that o is not in a bk x . . . x bk cell ofVk is at most P[o is in 
U KeVk d bk K] < cb k /a k . 

Next, let Kj = ^T=j V i- We win P rove that: 

Lemma 2.7. The probability that o is not in a bj x . . . x bj cube oflZ'j is < YlT=j c ^k/dk + 
YZ J+ iCob 3 /b l <c2-*. 

There are two possible reasons for TZ'a (o) not to be a bj x . . . x bj cube: either the 
Vj(o) already fails to be a bj x . . . x bj cube (in this case we say that o is 6j-bad), or Vj(o) 
is intersected by some cell boundary of Vi, i > j. The bounds for these two are provided 
by Lemma 2.6 and Lemma 2.8, and hence Lemma 2.7 will follow. 

Lemma 2.8. Suppose k e Z + , p > 0. The probability that the ball of radius p around o is 
intersected by some cell-boundary from Vk is at most cop/bk, where cq is some constant 
independent of p and k. 

Proof It is clear that for any b k x . . . x b k cell C in V k , Vo\{d p C)/Vo\{C) < d ' pbfr 1 /b d k = 
c'p/bk with some constant c', and thus Lemma 1.8 shows that the probability that B{o, p) 
is intersected by some cell boundary is at most P[o is 6 fc -bad] + c'p/bk < c/2 H + c'p/b k < 
cop/b k with some constant cq, also using Lemma 2.6. I 

Proof of Lemma 2.7. By Lemma 2.8 and Lemma 2.6, the probability that o is in a cell of 
VJ i that does not coincide with a bi x . . . x bi cell of Vi is at most P[o is 6i-bad] + P[the 
Pi-cell of o is intersected by the boundary of some Pj-cell, j > i] < cbi/ai + J^JLi+i c 'bi/bj, 
with some constants c, d . I 

For any j, call the cells of IZ'j that are not bj x . . . x bj cubes irregular cells. Note 
that by Lemma 2.7, the probability that o is contained in an irregular cell of some lZ' k , 
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k > j, is < c'2~ 2 . Hence every irregular cell (in any of the TZ'j) is contained in some 
maximal irregular cell of some TZ' k (k > j), and the probability that this k is greater than 
some k is < c2~ 2K . Let X be the set of maximal irregular cells. We mention that X is not 
necessarily a partition of M d , but a set of pairwise disjoint subsets of it. Let TZ" be the 
common coarsening of TZ'j, and X. By Lemma 2.1, TZ" is still a sequence of coarser and 
coarser partitions. 

Let IZj be a refinement of TZ", to be defined as follows. For each cell C in X, we 
subdivide C by a Z d grid (placed on C in some deterministic way, say with origin on an 
extremal point for some fixed hyperplane). Now replace every cell C G In TZ" by this 
refinement. The other cells of TZ" (those that are not cells of X) are unchanged. Of course 
we still have a sequence of coarser and coarser partitions, and Lemma 2.7 remains valid 
for the resulting TZj : 

Lemma 2.9. The probability that o is not in a bj x . . . x bj cube of TZj is < Yl^j c ^k/ a k + 
Yl^Lj+i c obj/bi < c2~ 2 . In this case, o is in a cell that is contained in a 1 x ... x 1 cube. 

An important fact is that the partition IZj is completely determined by the elements 
of the SiS with i = aj, Oj+i, . . ., and that by Lemma 2.9 a cell of IZj is either a b j x . . . x b j 
cube (in which case we call it a good cell), or a cell of diameter < d~ x l 2 . The IZj satisfy 
the claim of Theorem 1.3 (as we show at the end of this section), except for that the sizes 
of the typical cubes grow fast (and not just double) as we increase j one by one. 

Given a subcube H of M d , recall the definition of right-faces and 0(H) (with respect 
to some given partition of Z d ) from before Lemma 2.4. Now we are ready to define the 
final sequence Qj of partitions, as in Theorem 1.3. The sequence {Qj} will be such that 
{TZi} is a subsequence of {Qj}- 

It is enough to define the "intermediate" partitions between TZi and TZi + i, for any i. 
Note that the cubic TZi-ce\\s do not necessarily subdivide C like a cubic grid, as illustrated 
by the left side of Figure 1. For each good cell C of TZ i+ i (that is, a b i+ i x . . . x b i+ i cube), 
and for £ = 0, 1, . . . , 2 l — 1, consider the subdivision Hi of C to dyadic cubes of size bi2 e . 
Now, for each K e Hi, consider 9(K) with respect to the partition TZi. Define Q\ as the 
set of 9(K)'s as K e Hg. (See the right side of Figure 1.) 

Lemma 2.10. The cell C Q of o in Q\ is a bi2 -pseudocube whenever o is in a good cell of 
TZi+i, unless I = 1 and o is in the bi-neighborhood of the right boundary of its TZi+\-cell. 
This exceptional event has probability < bi/bi + i = 2~ 2 . 

Proof. For £ = the claim is true even with cubes instead of pseudocubes. The volume 
condition for a pseudocube is clear from the construction and Lemma 2.3. To verify the 
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isoperimetry condition, let H e Hi be the cube that we used to define C Q = 0(H), and 
apply Lemma 2.4 together with the volume condition. I 

Finally, let (Qj) be the sequence resulting from the finite sequences (Qj)i when we 
put them one after the other as % = 1, 2, . . . 



i I I i I i I i I i I i I i I i I i eb I I I II i i i i i i i i i i i i i i i i i i i m i i ii 

To the left: The subpartition of a good cell C of TZi+i by the cells of TZ{. To the right: 
the partition He of C by dyadic cubes (dashed), and the 0(K) (thick lines), K e Hg. 

Figure 1. 



To finish the proof of Theorem 1.3, we shall show that for a fixed o € Z d the labels of 
the vertices in the Qj-cell of o are i.i.d. conditioned on an event of probability tending to 
1 rapidly with j. 

Before going into that, let us remind ourselves to the construction of Qj. See Figure 2a 
and 2b for schematic pictures of the sequentially constructed partitions and X. Note that 
we changed the scales for the figure (and this is the cause of seemingly many noncubic cells, 
which is not the case when one uses the the proper parameters as in our construction). 

REMARK 2.11. Summary of the construction of the partitions: 

(1) Sk consists of the vertices that have /c-bulbs around them. 

(2) Vk is a subpartition of the Voronoi tessallation V(S ak ) on S ak to bu x . . . x bu cubes 
(with the exception of a small proportion of the cells). 
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(3) TZ' k is the common refinement of the sequence of Pi's, with index i starting from k. 
This way most of the cells in TZ' k coincided with the cubes in on the other hand 
the TZ' k is a sequence of coarser and coarser partitions. 

(4) I is the set of cells C that are not bk x . . . x bk cubes, with C E TZ' k , and maximal 
with this property. TZ k is the common coarsening of X and 1Z' k . 

(5) We defined TZk from TZ k by subdividing its non-cubic cells (and possibly some others) 
to small chunks whose diameters are uniformly bounded. We still have a sequence of 
coarser and coarser partitions, and most of the cells are still bk by bk cubes. 

(6) Finally, Qj is a sequence that we obtained from TZ k by putting "intermediate" par- 
titions in the sequence (TZk) so that a "typical cell" is a pseudocube of size always 
doubling as j increases by 1. The subsequence TZk provides the one given in (iii). 

Let us point out that the constructions of the partitions Vj,TZj,TZ'j and Qj did not 
use any information besides that coming from Sj, Sj+i, . . .: 

Lemma 2.12. Every partition Qj is a deterministic function ofSj. 

By Lemma 2.10 we know that Qj satisfies (i) in Theorem 1.3. 

To show (ii) in Theorem 1.3, fix j and let i be such that bi < 2 J < that is, 

2 i < j < 2 i+1 . Let B : = B(o,2b i+1 ). Note that the cell of o in Qj is contained in B. 
Further, ai is such that one was using only elements of S ai (and hence possibly elements 
of S n with n > a,i) to define TZi (and thus Qj). Call a vertex x in Z d a giant, if it is an 
a^-giant, as defined in Lemma 2.2 (t = aj). Note that by definition, the existence of giants 
is independent of the configuration within B. 

Our key observation is that if Z d contains no giants, then no element x of S ax can 
be so close to o as that the largest bulb around x intersects B. That is, if there are 
no giants, one can tell the elements of S ai , S ai+1 , ■ ■ ■ without looking into B. Hence the 
configuration outside B determines the cell C of o in TZi+i, together with the subpartition 
of C by TZi - and these two determine the cell of o in Qj. (Here we are using Lemma 2.12.) 
Then, conditioned on this event (no giants), the vertices in the cell of o in Qj have i.i.d. 
labels, since the cell of o is contained in B. Now, by Lemma 2.2, there are no giants with 
probability > 1 — Cc~ a t . Furthermore, the probability that the cell of o in Qj is not a 
2 J x . . . x 2 J pseudocube is at most the probability that it is not in a bi x ... x bi cube of 
TZi plus the probability that it is at the right boundary of its 7^-cell, as in Lemma 2.10. 
By Lemma 2.9 and Lemma 2.10, this is bounded by c2" 2 ' + c'b' 1 < c"2" 21 < c"2 -2a_1 . 

Define Aj to be the event that there are no giants, and the cell of o is a pseudocube. 
We have just seen that P[Aj] > 1 — c2~ 2J \ and this finishes the proof of (i) and (ii) in 
Theorem 1.3. Part (iii) follows by setting Q a ^) := TZk- 
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Four points of Sk, their 
neighborhood, and V(Sk) 





1Z' k (thicker lines indicate boundaries 
ofVj,j = k+l,...) 



T (the grey regions are not 
covered by any set of J) 



Figure 2a. Construction of the sequence of partitions 
(on a scale altered from the real one) 
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TZk, resulting from IZ'l (the finest common coarsening of X and 1Z' k ) 
by subdividing irregular cells to cells of small diameters 



Figure 2b. 



§3. The matching rule. 

Recall that an n-pseudocube or a pseudocube of size n in Z d is a subset C that contains 
some n/2 x ... x n/2 cube and is contained in some 2n x . . . x 2n cube, and satisfies the 
isoperimetry condition. Call the elements of U;<2j dyadic pseudocubes. The reason 
for the name is that by Theorem 1.3, the pseudocube of Qi is about twice the size of 
the pseudocube in Qi-i, and these partitions are coarser and coarser, so most of the 
pseudocubes in Qi are subdivided by pseudocubes of Qi-i in a dyadic pattern. 

For some subset S of Z d , denote by y(S) the set of yellow elements of S, and by b(S) 
the set of blue elements of S. 

Given some C C Z d , with Bernoulli (1/2) labels on it, let the surplus of C be \b(C) — 
y(C)\. Denote this quantity by sur(C). 

Note the distinction between subgraphs of a graph G and graphs on the vertex set of 
G. Also, there is a bit of ambiguity about the use of the word edge : sometimes it refers 
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to edges of Z d , and sometimes to pairs in the matching, but it is always clear from the 
context. 

From now on fix function f : W + i— > R + to be f(x) = x 3~^~ . Say that a pseudocube 
C is bad, if sur(C) > /(|C|). 

Before presenting the matching rule, let us prove a few simple lemmas. 

Lemma 3.1. Let A c M, d be connected, and U C M, d with mm Xjye u\x — y\ > s. Then if 
diam(A) < s, then \U C\ A| < 1. Otherwise 

\AnU\ < c|aA|diam(A)s- d . 

Proof. If diam(A) < s, then the statement trivially holds by the assumption on U . So 
suppose diam(A) > s. Let A s be the s-neighborhood ("fattening") of A. We have |A S | < 
c|<9A|diam(A). Since the balls of radius s/2 around points of A fl U are disjoint, and they 
are all contained in A s , we have |A fl U\ < c'|A s |s~ d < c|9A|diam(A)s~ d . I 

Lemma 3.2. Let S C R d be a cube, S' c S, \S'\ > \S\/2. Then there exists an S" c S' 

d— 1 — e/2 l+e/2 

wit/i |S"'| = /(l^l) = |>S| 3 /2 suc/i that dist(x,y) > c|5| d2 /or every x^y G S"', roitt 
c depending only on d. 

l + e/2 

Proof. Pick elements of 5" for 5"' one by one, always removing points of the c\S\ ^ . 
neighborhood of the chosen point from S (and picking next elements from what remains). 

I 

Fix a k, to be determined later. We will choose it large enough, and a power of 2 for 
technical convenience. Note that in what follows, c and d will always denote constants 
that do not dependent on k. Define a(0) := log A;, and 6(1) := 1. 

Choice of a(i) and b(i): Fix b(i) > 2* d , i = 2, 3, . . ., to be a subsequence of a(i) (where 
a(i) is as defined in Theorem 1.3). Fix an increasing subsequence a(i) of a(i), i = 1, 2, . . ., 
such that a(i) > max{6(z) 2d/e , exp(exp(z d " 2 ))}, and such that Y.7=a(i)/d( k2U )~ e/2 ^ 
\k^b(i + l)~ d . 

The next lemma for later use is of elementary geometry. The claim is intuitively clear, 
but not trivial to prove. We do not need the specific value of the constant, but ca = 2 d+2 
works. Recall, that for any subset S of Z d 1 we denote by dS the set of vertices with degree 
< 2d in S. In the next lemma, if E is some set of edges and V is some set of vertices, let 
Ey stand for the subset of edges E incident to some element of V. 

Lemma 3.3. There exists a constant ca, depending only on d, such that the following holds. 
Let T be some subset of the edges of an m x . . . x m cube K , and let Ai, ... , A^ be the 
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connected components of K\T. Then for all but at most one of the j 's we have 



\dAj\ < c d \T dAj \. 



Consequently, there is a Aj such that 

J^ ldA.l < 2c d \T\. 

Proof. Let i be such that |<9Aj fl dK\ is maximal (in case of ambiguity, decide arbitrarily). 
Take any j ^ i. For x G dK and z G K, define P(x, z) as a path from x to z that 
makes as few turns as possible (otherwise its choice is arbitrary). Regard P(x,y) as some 
element of the vector space 15t^( K \ where ~F?(K) is some arbitrary fixed orientation of 
the edges of K. For each element (x, y) of (dAj n dK) x (dK \ dAj), let P x , y G 

be ]W\T,zeK( p ( x ' z ) ~ p ( z ,y))- Finally, define a flow /i as Ex, y p^M^^.y where 
the sum is over (x,y) G (dAj fl 9K) x (9K \ dAj). It is not hard to see that the flow 
through any edge is < ^ Cd ~ 1 ^ (1 + ) < (c^ — 1) with some constant c^, where the 

second inequality follows from j ^ i. (A proof for the 2-dimensional case, can be found 
in [A], Lemma 9. The only additional thing needed, is an upper bound of order \K\ for 
the number of paths P(x,z), (x,z) G dK x K containing an arbitrary edge e. Now, if 
e = {(vi, . . . ,v v , . . . , Vd), (vi, . . . , v v + 1, . . . , I'd)}, then one of x and z has j'th coordinate 
equal to Vj for every j ^ z/, which gives an order choices for them, but also x has 

to be on the boundary, which makes the number of choices be of order \K\. ) The total 
strength of the flow is \dAj fl dK\. Since is a cutset between the sources and the 

sinks of the flow, the total amount flowing through it is equal to the strength. Putting 
this together with the bound on the amount flowing through an edge, we obtain: 

\dAjndK\ < (c d -i)|r 9A ,|. 

Hence | dAj \ < \ (dAj n dK) U T dAj \<c d \T dAj \. I 

The following lemma of several later uses is Chernoff's bound about independent 
Bernoulli sums, in the language of surpluses. 

Lemma 3.4. The probability that a fixed pseudocube K of size k2 l has surplus > f((k2 l ) d ) 
is < eM-f((k2 t - 1 ) d ) 2 /2(k2 l+1 ) d ). 

Proof. Chenoff's bound provides an estimate < exp( ~^|^^ ) for the probability. Then 
use the fact (k2 l ~ 1 ) d < \K\ < (k2 l+1 ) d ). I 



17 



Definition of the boundary of a partition: Given some partition V of the vertices 
of some graph in Z d , we denote by dV the subgraph induced by the union of the (inner) 
vertex boundaries of the cells ofV. 

Definition of V: Consider Qi from Theorem 1.3. For a C G Qi, let the cells of 
Qi-i in C be called the bricks of C. Let the cells of <2i og A; = Q a (o) be called basic 
pseudocubes. For j > logfc, say that a cell C G Qj is bad, if sur(C) > /(|C|), or if C 
is not a 2 J -pseudo cube. This definition implicitly relies on j and Qj; however, one can 
extend the definition of "bad" to any element C of UQj for the following reason. Since 
the Qj are coarser and coarser partitions, one can trace back for each C G U™ log k Qj the 
smallest j such that C G Qj, and say that C is bad, if it is bad in Qj. 

Call C ripe, if C is not bad, but one of its bricks is bad, and further, C is maximal 
among pseudocubes of UjQj with this property with respect to inclusion. That in fact 
there is a maximal such C containing o for every o, follows from the next lemma. 

Lemma 3.5. The probability that Qi+\ og k(o) is not bad, but one of its bricks is bad is at 
most cexp(-c'(/c2*) d ~ 2 ~ e ). 

Proof. For x G C, the probability that Qi-i+\ oe k(x) is bad is bounded by cexp(— cf (k2 l ) d ~ 2 ~ e ) . 
This is a consequence of Lemma 3.4 (in case Qi-i+\ gk(x) is bad because it has too large 
surplus), and Theorem 1.3 (in case Qi-i+\ gk(x) is bad because it is not a pseudocube). 
Hence 

P[Qi_i_i og fc(o) is not bad but some brick of it is bad] < 

< P[Qi- 1+ i ogk (x) is bad for some x G -B fc2 *+ 2 ] < cexp(-c'(/c2 J ) d ~ 2 ~ e ) 
if k is large enough, for every i G Z + . I 

A bound of the same magnitude holds for the probability of being in a ripe pseudocube 
as for the probability of being in a bad pseudocube of similar size, as stated in Lemma 3.6. 

Call a pseudocube C an elementary pseudocube if it is ripe or it is a basic pseu- 
docube that is not contained in any ripe pseudocube. It's a consequence of the definitions 
that elementary pseudocubes in UjQj partition Z d ; call this partition V. An important 
property of V is that none of its cells C G Qj is bad in Qj. Lemma 3.6 gives an exponential 
bound on the tail probability of the diameter of the P-cell of a point. In fact, the reason 
we defined pseudocubes, and the partitions of always doubling approximate cell sizes in 
Theorem 1.3 (instead of just taking the more convenient sequence Q a (k))i is to have this 
control of the tail coming from Lemma 3.4 (which is inherited from the tail for bad cubes). 
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Lemma 3.6. The probability that a vertex is in a ripe pseudocube of size k2 % is at most 
cexp(— c' (k2 l ) d ~ 2 ~ e ) . Hence, the probability that the V-cell of a point has size k2 l has the 
same bound. 

Proof. By Lemma 3.4 and the fact that a ripe pseudocube of size k2 l contains a bad 
pseudocube of size k2 l ~ x . I 

Definition of Vf. If we find a maximal matching within each "P-cell, the set of 
unmatched points is "relatively small" . We will then match these points, using a sequence 
of coarser and coarser partitions. By definition, V is such that for any cell C of Qi, 
i > log k, C is either contained in some cell of V, or it is a union of some cells of V. Define 
Vi as the common coarsening for Qi and V, i > a(0). (In particular, Vq = V.) Since V is 
a refinement of Vi for any i, every cell of Vi contains some elementary cell. In particular, 
we achieve the following, which was the goal of the last few paragraphs: 

Lemma 3.7. If i > log A; then no cell of Vi is bad. That is, every cell C e Vi is a 2 % - 
pseudocube with some i' > i, and sur(C) < f(\C\). 

Call a cell C G Vj cubic, if it is a 2 J -cube, otherwise it is non-cubic. If j = a(i) for 
some i, then every cell of Qj that is not bad is cubic, hence Vj(o) can be non-cubic only 
if it is a cell in V \ Qj, and hence Vj(o) has size > 2 J . This probability is bounded by 
Lemma 3.6. These two give us that with probability exponential in 2^ d ~ 2 ~ 6 \ a point is 
in a 2 J -cubic cell of C E Vj = V a (i). 

For each % we will define an invariant monochromatic subset Ui of Z d and a matching 
Mi in such a way that \Ui D C\ = sur(C) for every C £ V a ^, and every vertex of C \ Ui 
is matched by Mi. 

For every 2 a W x . . . x 2 a W cube S in V a (i) , if the majority of points in S is blue, choose 
S' C S to be the set of blue vertices, otherwise let S' be the set of yellow vertices. Choose 
S" from S' as in Lemma 3.2. Finally, pick an arbitrary subset of cardinality sur(S') from 
5"'; call this Ui(S). Now let Ui := L)Ui(S). (Of course the above choices can be made cell 
by cell in some predetermined way, that is the same for every translate of the cell, to make 
the resulting Ui invariant.) Note that 

l + e/2 q(i)(l + e/2) 

mm Xj y eUxn s\x - y\ > c\S\ d 2 =2 a , (3.8) 

by our choice coming from Lemma 3.2. 

Then we have, by Lemma 3.1, the following: 

Lemma 3.9. For any S cubic cell of size 2 a W ofV a ^, and connected subset A of S, either 
|A fl Ui\ < 1, or one has 

|A n Ui\ < c\dA\2 ail) (mm x , yeUt dist(x, y))- d < c\d/\\2~ ea{l)/2 . 
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K e V a{i) K 

White cells represent cells of V . The multiplicity of each edge is the number 

Thick lines show cell boundaries of VhU)- of edges between two small cubes on the left. 

Figure 3. The contraction f3 from K to K. (Scale altered from real. Pseudocubes of 
V and Vui\ are represented by cubes.) 

Now we are ready to present the matching algorithm. 

Step 1. For each elementary pseudocube C (i.e. cell of V) match blue points with 
yellow points as long as it is possible. 

Take the union of these matchings over all elementary pseudocubes C, and call the 
resulting matching A4q. Note that in each elementary pseudocube C the number of points 
not matched by .Mo is equal to sur(C) < f(\C\) (using Lemma 3.7 applied to P a (o)> i- e - 
the fact that an elementary pseudocube is not bad). Hence, at least |C|/3 vertices of each 
color in every elementary pseudocube \C\ are matched by A4q. This, and the bound on the 
number of unmatched points will have importance when we will define pairwise disjoint 
augmenting paths. It is also clear that .Mo is invariantly defined. 

The next lemma is straightforward from Lemma 3.6 and the definition of .Mo- 
Lemma 3.10. The probability that a vertex, given that it is matched by .Mo, has its pair 
at distance > r from it, is < cexp(— c'f(r d ) 2 /r d ) < cexp(— cV d_2_e ). 

Denote by Uq the set of points that are not matched by Aio- 

For sets A and B, the multiset union of them will be denoted by AUB; that is, A(jB 
is the union of A and B, with elements in A fl B having multiplicity 2. If A is a multiset 
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and B is a set, then we define A fl B to be the multiset such that every x that is in B will 
have the same multiplicity in A fl B as in A. 

In Step 2, we proceed in countably many stages. In Stage i, as i = 1, 2, . . ., we define 
a rematching procedure, that will match every point of Z d \ Ui to some other point. In 
particular, it matches the points of \ Ui (which were unmatched before). The idea 
is that in Stage i we redefine only edges that have an endpoint in a set of "P-cells (which 
is "sparse" of density about l/b(i)). The method ensures that the tail of the matching 
remains of the same magnitude as for Mo (provided by Lemma 3.10). The rematching 
will use augmenting paths. When i > 1, in Stage i, the scarcity of Ui-\ (which is a result 
of our choice for the sequence a(i)) makes the rematching simpler. However, in Stage 1, 
the set Uq of points to be matched is coming from Step 1, and this fact is responsible for 
more difficulties and the sharp role of isoperimetry in this case. 

Definition of N{K): For each cubic cell K in V a {i) we will do the following. Fix i 
and a K in P a (i) f° r the rest of this section. Define K by contracting every cell of V to a 
vertex, that is, contracting elementary pseudocubes of K. We do not erase any edge after 
the contraction, multiple edges are allowed. Let $ be the contraction mapping from K to 
K. (See Figure 3.) 

Then define N(K) to be a network on a graph (V, E), with capacities on edges. Here 
V is the union of V(K) and two extra vertices B and Y, and E is the union of E(K) and all 
edges of the form {£?, x} or {Y, x}, x G V(K). Define capacities as follows. Let every edge 
incident to some vertex in K\df3(Vb(i)) have capacity 0. Recall that d/3(Vb(i)) denotes the 
union of the vertex boundaries of the cells in the push- forward partition $(Vb(i))- Let each 
edge induced by V(K) fl d(3(Vb(i)) have capacity k' J=T /12d', where 2d > d! > d is chosen 
so that it makes this number an integer (and at least 1, using that k is large enough). 
Note that for any x G K, J2 x ee ca P( e ) is at most half the number of points in / 9~ 1 (K) 
that are matched by Mo, by the definition of Mo and K, and the properties defining a 
pseudocube. The same will be true if we replace Mo by Mi-i, because the endpoints 
of Mi-i will always contain the endpoints of any previous matching. Finally, for every 
vertex x in (t/jUt/j_i) fl K, choose a "representative" x G V(K) fl d/3(Vb(i)) such that x 
and f3~ 1 x are in the same V^iyceW. For each such x, if x is blue and x G Ui-i or x is 
yellow and x G Ui, add an edge of capacity 1 between B and x, otherwise add an edge of 
capacity 1 between Y and x. Let the other edges on B and Y have capacities 0. Note that 
the total capacity of the edges on B is equal to the total capacity of the edges on Y, by 
the choice of Ui (because b(Ui n K) - y(Ui n K) = b(Ui-i n K) - y{lJi-\ H K)). 

What we are really interested in is the set of vertices incident to some edge of nonzero 
capacity in d/3(Vb(i))V {B, Y}, and hence the network induced by them. Having the bigger 
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network here, with many edges of zero capacity, has the advantage that it is easily related 
to K. This technical convenience will make it easy to use some of our geometric lemmas. 

For anlcK, denote by N B) x and N Y ,x the subnetwork of N(K) induced by BLiX 
and Y Li X respectively. 

We will form a set E(i, K) of pairs from the elements of L/j_iUC/j in K, and a set of 
paths V(i, K) := {P(x, y) : {x, y} G E(i, K)}, in such a way that: 

(i) elements of Ui are paired with elements of the same color in and all the elements 
of Ui-i that are not paired this way, are paired with an element of the opposite color 
in Ui-i. Call the set of such pairs E(i, K). For any pair {x, y} G E(i, K), P(x, y) is a 
path in K n d$(Vb(i)) between x and y (where v is defined from v as in the definition 
of N(K)). 

(ii) Every vertex x G K is contained in at most \f3~ 1 (x)\/?> elements of D^kP. 

Proposition 3.11. // the maximal flow on N(K) from B to Y has a = \ U ^ K ]+\ U ^ K \ 
strength, then there exists a set of paths V(i,K) (with a set E(i,K) for the pairs of end- 
points), that satisfy (i) and (ii). 

If such V(i, K) exists, then there is a set V(i, K) of pairwise vertex- disjoint paths on 
the vertices of K such that for every P G V(i, K), P = (xi, . . . , x m ), there is a P G V(i, K), 
P = (vi, . . . , v m i), such that v\ G Ui-\, and: 

(I) m' = 2m + 1 if and only if v m i G U^; m' = 2m + 2 if and only if v m > G Ui-\; 
(II) v\ and v m i are such that (3v\ = x\ and f3v m ' = x m ; 

(III) for v = 1, . . . ,m, one has v^v G and further, v^ v and v^v+x o,re matched by 

M.i-\. Moreover, the vertex colors along P are alternating. 

Figure 4 illustrates the connection between the elements V(i,K) and V(i,K). 

Proof. If there exists an admissible flow from B to Y of strength \Ej— ir '"- r ! - 



2 

J2 X cap({S, x}) = J2 X ca P({^ x }) = a 7 then there is also an integer valued flow, since 
the constraints on every edge are integers. Such a flow can be decomposed as a sum of 
paths. Delete the first and last edge (the ones incident to B and Y) from each of these 
paths, and define the set of resulting paths to be V(i, K). By the definition of N(K), this 
shows the first assertion, (i) and (ii). 

We can take a preimage in K by (3 for each element P(x, y) in V(i, K), increase them 
by attaching one point of Ui-\ U Ui to each endpoint of the preimage, to find a set of 
pairwise disjoint alternating augmenting paths P(x,y) that satisfy (I)- (III). 

The connection between P(x,y) and P(x 7 y) is simply the following. The two end- 
points of P(x, y) are x and y. The 2j'th point of P(x, y) is going to be a vertex from 
(3~ 1 (xj) that is covered by Mi-i, and chosen to be of a color different from v\. Then 
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the 2j + l'st vertex will be the pair of v by The choices of the V2j are otherwise 

arbitrary, except for that V(i, K) := y) : {x, y} G i?(z, X)} has to consist of pairwise 

disjoint paths. 

We can indeed choose the preimages P(x,y) to be pairwise disjoint and fulfill (III), 
by the choice of the capacity constraints in V(K): every x G V(K) is crossed by at most 
as many paths of V, as the number of edges of Mj_i with a yellow (respectively: blue) 
endpoint in /3 _1 (a;), and so there is a choice when each of these edges is present in at most 
one of the augmenting paths. I 
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Figure 4. A path of V(i,K) in K (left), and the corresponding augmenting 
path of V(i, K) in K (from Figure 3). The shaded region is (3~ 1 d(3Vb(i). 

We shall consider the path P(x, y) as an augmenting path, and replace the edges of 
P(x, y) in M.i-i by those that are not in there. Doing this over all K G V a u) and P(x, y), 
we get M.i- For a particular K G V( a (i)), after doing the "flip" for each augmenting path 
in V, all of K but K D Ui is matched. Moreover, the densities of the different edge-lengths 
still decay exponentially, see Lemma 3.13. 

Let us summarize our conclusion about augmenting paths, also adding a claim that 
the limiting perfect matching exists (which is yet to be shown): 

Proposition 3.12. If the conditions (and hence the conclusions) of Proposition 3.11 hold, 
then Step 2 is successful, and by the end of stage i, all vertices in K\Ui are matched. As 
i —>■ oo, we get a perfect matching in the limit. 
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The following lemma tells that the mathings Mj stabilize (hence proving the last 
assertion of Proposition 3.12), and gives the tail probabilities for the limiting matching. 
For simplicity, in this statement and its proof, by an edge we mean a pair in one of the 
.Mi's. For a vertex x, Mi{x) will denote the pair of x by Mi, or the emptyset if x is not 
matched. By ||e|| we denote the distance between the two endpoints of e. 

Lemma 3.13. Suppose the conditions in Proposition 3.11 are satisfied. Then for every 
vertex v G Z d there is a number j(v) such that v is contained in an edge e with the 
property that e G M.j, for every j > j(v). Moreover, we have 

P[||e|| > r] < cexp(-cV d - 2 - e ). 

Proof. The edges stabilize for the following reason. In order for a vertex x to be in different 
edges or no edge in (Aij)j infinitely many times, it is necessary that either x or Aij(x) is 
contained in \JKev a f 3 )P{3i K)-> an d hence in (i~ x d(iV\ ) {j) U Uj, for infinitely many j's. The 
probability of this is 0, since £ P[x G /3 _1 <9/3:P 6(i) U Uj] + P[Mj(x) G /3 _1 a/3P b(i) U Uj] < 
2 ^2 P[x G (3~ df3Vb{j) U Uj] by Lemma 1.9, and this sum is finite. 

Now, let e be as in the claim, and j(v) be the smallest nonnegative integer such that 
e G A4j whenever j > j(v). If j(v) = 0, then by definition of Aio, the endpoints of e are 
in the same V-ce\\, and the claim follows by Lemma 3.10. 

Otherwise there is some sequence of edges that v is contained in, until it stabilizes from 
M.j(v) on. Let h be the greatest number such that v G Uh if such a Uh exists, otherwise 
h := 0. There is an m, and a sequence i±, . . . , i m , %\ := h+1, such that A4i j (v) ^ M.i _i(v). 
We may assume that m is maximal such. Let the edge containing v in M.i^ be called ej. 
By maximality, e m = e. Since v G U^—i = Uh, by (II) and (III) we have that Aii-i(v) is 
in the p^V^-cell of v. Thus ||ei|| < Vd2 b ^ +1 , unless v is in a V-cell of size > 2 b{ - h \ 
This shows the following tail for ||ei||: 

P[||ei|| > r] < 2V[v G Uj , j > log -^=] +P[V(v) has radius > r/2rf 1/2 ] < 

2yd 

< c2 -«([ lo g-/ 2 v^]) 1+£/2 +CoeX p(- c [ ) r d - 2 - e ) < cexp(-cV d - 2 - e ). (3.14) 
We got the second term on the left of (3.14) by Lemma 3.6, and the first term from 

oo 

P[v eUi, j > logr/2Vd] <c mm X)yeUi dist(x,y)~ d < 

j = [logr/sqrtd] 

oo 

j=[logr/Vd] 



24 



using (3.8). For the last inequality of (3.14), we used the choice of a(i). 

We get each from e^-i by the "switch" along some augmenting path P(x, y), and (by 
the choice of h) we also assumed that they are inner edges of this path, whenever i — 1 > 1. 
Thus, ei and e^-i being consecutive edges in P(x, y), they share one endpoint, while their 
other endpoints (x(ei) and x(ei-i) are in adjacent P-cells). Hence P[||ej|] — ||ej_i|| > 
r] < P[\x(ej) -x(e j - 1 )\ > r] < 4P[V(o) has diameter > r/4] < cexp(-c'(r) d " 2 - e ), by 
Lemma 1.9 and Lemma 3.6. That is, 

P[|M| - ||ei_i|| > r] < cexp(-c'r d - 2 - e ). (3.15) 

Also, for each edge e^, if v switched from edge e^-i to edge ei in some stage j, then in 
particular, one of the endpoints of has to be in (3~ 1 d(3Vb(j)- We can give the following 
rough bound on this probability: 

2P[o G f^Mfi))] = 2P0V b(j) (o) E dp(V b{j) )} < 

< P[diam(P(o)) < b(j) , dist(o, d(V bU) (o)) < b(j)\ + P[diam(P(o) > b(j)] < 

< P[dist(o,d(V b{j) (o))) < b(j)} +P[diam(P(o) > b(j)} < 

< c2~ b ^b{j) + ceM-c'b{j) d - 2 - e ) < cexp(-c / 6(j) d - 2 - e ). (3.16) 

Here the bound on the first probability is a consequence of Lemma 1.8, and the bound on 
the second probability is by Lemma 3.6. 

Let A be the event that m > logR and B be the event that {{||eo|| > R/logR} or 
| |ej| | — | |ej_i| | > R/ logi? for some i e {2, . . . m}}. We finish the proof by noting that for 
||e|| > Rio happen, one has to have at least one of A and B hold. Using (3.16) to bound 
the probability of A (which is < 2P[o G U°i [logr] ^- 1 a / 5(P 6(j) )), and (3.14) and (3.15) to 
show P[B] < clog.Rexp(— c'(R/ log R) d ~ 2 ~ e ) : this gives 

P[e > R] < cex.p(-cb(\\ogR])) + cexp(-c(R/ logR) d - 2 - e )logR. 

We conclude that 

P[e > R] < cexp(-c'R d - 2 - 2e ), 

since b( [log R]) > R d . I 

So, all what is left is to show the existence of an admissible flow in N(K) of strength 
a = l^-d+l^l from B to Y, since then the conditions in Proposition 3.12 and Lemma 3.13 
follow. 
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By the maxflow-mincut theorem, the existence of such a flow follows if we show that 
every mincut has capacity > a. 

Before proving this, let us state a rough estimate relating the size of the boundary 
of a subgraph of K (or K) to the number of dyadic pseudocubes of a certain type in it. 
Given A C K, denote by OA as before, the set of vertices that have degree < 2d in A, and 
denote by djA the j-neighborhood of d A. 

Lemma 3.17. Let A be an induced subgraph of K, and A := ,9 _1 (A) c K. Then, for 
the set Di(A) of dyadic k2 £ -pseudocubes in A that are not contained in any larger dyadic 
pseudocube in A, we have 

\D e (A)\ < 2\dA\d 1 / 2 (k2 l ) 1 - d . 

Proof. Every element of De(A) is contained in And d i/2 k2 £+iA, as can be seen by induction 
on i. The elements of Dg(A) are disjoint. Thus dividing | And d i/2 k2 e+iA\ by the lower bound 
(k2 e ~ 1 ) d on the volume of an element in Dp, we get an upper bound 2d 1 ^ 2 \dA\(k2 e ) 1 ~ d for 
D e (A). Here we were also using \A H d d i /2k2 i+i A\ < \dA\d 1 / 2 k2 e+1 . I 

Proposition 3.18. There is an admissible flow of strength a from B to Y in N(K). 

Proof. We will prove by contradiction. So, suppose that there is some it minimal edge- 
cutset between B and Y, and that cap(7r) < a = ^ x cap({S, x}) = ^ x cap({Y, a;}. 
Hence ir contains some edge of K. Let 7 = 7r n K, and Ci, C 2 , ■ . . , C m be the connected 
components of (K fl d(3Vb(i)) \ 7- One can find a set 7' of edges of capacity, such that 
K \ (7 U 7') has components C(, . . . , C' m , such that that Ci C and 7' minimal with 
this property (i.e., for any e G 7 U 7', K \ (7 U 7') U {e} has less than m components). 
Furthermore, one has: 

| 7 '| < 2 d ^ +1 ^ +1 \ 1 ndf3V h{{) \. (3.19) 
This is true because by definition, every cell C of Vui\ is either a pseudocube of size 

2 b(i) 

in Qb(i)i or an elementary cell from V (which is thus contracted by 0), hence every class 
of f3Vb(i) has cardinality < 2 db ^ +1 . We can assign to each element of 7 fl C, C E (3Vb(i), 
some e G d$Vb(i) in the $Vb(i)-ce\\ of C, to get (3.19). 

Note that the edges in 7' have costs by the definition of N(K). Hence we may 
replace tt by w U 7' and 7 by 7', to have (by (3.19)): 

Lemma 3.20. There exists a minimal cutset it between B and Y in N(K), 7 = K fl n, 
such that the number of components in K \ 7 is the same as the number of components in 
(K n dpP b (i)) \ 1, o,nd further, | 7 | < {2 d ^)+ 1 )+ 1 + 1)| 7 n d(3V m \. 

Now, for each Cj, one of Ny,Cj and Nb,Cj (call it iVj) has to belong to 7r, otherwise 
there is a path from B to F through Cj that avoids 7r. 
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Lemma 3.21. There is a j' e Z such that 

£ W -1 ^-)! < cirsi < C | 7 | < c '2^| 7 n9M(4 

Proof. The first inequality is by Lemma 3.3. The second is simply because after contracting 
by (3 we kept multiple edges. The last, rough inequality is true because of Lemma 3.20. I 

Apply Lemma 3.3 to V := /3 _1 (7) and Aj := /3 _1 (Cj). We may assume that i = 1 
is the (possible) exception in Lemma 3.3 (and equivalently that C\ is the exception in 
Lemma 3.21), and assume by symmetry that N\ = NB,d- Furthermore, we may assume 
that every Aj is such that 

|Aj-nC/i|>l, (3.22) 

otherwise if Aj D Ui = {x}, it is easy to see that we could remove part of dCj (edges of 
capacity /c^ 1 /12d' > 1) from n, and add the edge between x and B or Y, to get a cutset 
of smaller cost than n. Similar argument works if Aj n Ui = 0. 
As mentioned, if it is a minimal cutset, then 

cap(7r) < cap(iV B g), (3.23) 

since N B g is a cutset itself. 

Use notation C(b(i)) := ( c 2 b Wd)-i fc d/(d-i) / '(I2d'), where the first factor and the 
constant c there is coming from Lemma 3.21, while the second factor is the capacity of 
edges in 7 fl d(3Vb(i)- By Lemma 3.21, using that every edge of 7 fl d(3Vb(i) has capacity 
fc<*/(«i-i)/(12d'), we can bound the left hand side of (3.23) as 

m m 

cap(7r) = cap( 7 ) + £cap(ATj) > C(b(i))\-y\ + £cap(iVj) = 
i=i 3=1 

m 

= C(b(i))\-y\ + cap(iVi) + £cap(iV B ,c 3 .) ~ In j= n y , Cj (cap(N B , Cj ) ~ cap(iV y , c .)) > 

m 

> C(b(i))\-y\ + cap(N BjCl ) + £cap(iV B ,c 3 .) - 1n j= n y , Cj sur(A, n U t - 1 ) - |Aj n C^|. 

Note that we were using our assumption on Ci = /3(Ai) for the last inequality. Putting 
this fact together with the last inequality and (3.23), we obtain 

m m 

c&p{N B k ) > C(b(i))\M + X>p(iV B , c ,) ~ E( 1 ^=iv y ,c,sur(Aj n U^) + \Aj n U t \). 
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(We define the empty sum as 0. That corresponds to the case when 7 is not a cutset for 
K.) This implies 

m m 

^|A,-nt/ i | + ^sur(A J nt/ i _i) >C(6(i))|7|. (3.24) 

g(i-l)(l + e/2) 

On the other hand, the minimal distance between elements of Ui-\ is > 2 a by 
the choice of U l - 1 (see (3.8)). Let A(i) := 2 a «( 1+e / 2 )/ d . We have that 

sur(a A(i _i ) A i n Ui-i) < \d A (i-i)&j n C/i_i| < I {a; : B.A(i-o(aO C <9 2 A(i-i) A.,} n < 

< \d2A(i-i)Aj\/A(i - l) d < c\d^\/A{i - If' 1 . (3.25) 

Using notation Di coming from Lemma 3.17, observe that 

A, C Oa^Aj U (uf:l gi A(i-i)) Ux e ^(A,) X). 
Subadditivity sur(# U H') < sm(H) + sm(H') yields 

a(i) 

sur(Aj n C/i_i) < sur(a A(i _i ) A j n E/i_i) + ^ ^ sur(X n U^) < (3.26) 

£=io g (A(i-i))xer> £ (A j ) 

< c\d^\A{i - iy- d + J2 mA^/mY). 

£=log(A(*-l)) 

Here we used (3.25) for the first term, and for the second term we used the fact that dyadic 
pseudocubes are not bad by definition. 

For the first term in (3.24), we get the following bound. In the first inequality we use 
Lemma 3.9 and (3.22), and in the second one we use Lemma 3.3 and the choice of C\. 
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\ A j n Ui\ < C 2~ a ^ e / 2 \ dA j\ < c2- a(l)e / 2 | 7 |. 

3=2 j=2 

Plugging this and (3.26) into (3.24), we obtain 

m a(i) 

c(bd))\i\ < c2-^/ 2 i 7 i + j2(cmM(i - + E mAjMfmr)) < 

3=2 e=io S (A(i-i)) 
< c 2- a ( i )^\ 1 \ + J2\dA j \(cA(i-l) 1 - d + (k2 i ) 1 - d f((k2 i ) d )) < 

3=2 ' £=log(A(i-l)) 

a(i) 

< \~ f \(c + cA(i-l) 1 - d + (k2 e ) 1 - d (k2 i ) d - 1 - e / 2 ). (3.27) 

£=log(A(*-l)) 

The penultimate inequality was a consequence of Lemma 3.17 applied to each Aj, while 
the last line follows from the first two inequalities of Lemma 3.21. 

We conclude, by the definition of / and C(b(i)), and using \og(A(i — 1)) > a(i — l)/d: 
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Proposition 3.28. // n is some cutset between B and Y , 7 = K n tt, and cap(7r) < 
min{cap(iV B cap(A^ y ^)}, then 

k^b(i)~ d < c + C2(- 1+1 /rf)^-i)(i+^) + (£^)" e / 2 , (3.29) 

£=a(i-l)/d 

where c and C are constants that depend only on d. 

The second term on the right is bounded by a constant (independently of k). By the 
definitions of a(i) and b(i), for i > 2, the inequality (3.29) fails, so there is no minimal 
cutset between B and Y different from N B ^ or N Y showing that the desired flow on 
N(co) exists. If % = 1, if k was chosen large enough, then (3.29) fails. This finishes the 
proof of Proposition 3.18. I 

From Proposition 3.18 and Proposition 3.11 we have the existence of V{i,K), and 
conclude by Proposition 3.12 that the rematching procedure succeeds. Combined with 
Lemma 3.13, this proves Theorem 1.2. 

We mention that once the a{i) and b(i) can be chosen to grow with a suitable speed, 
the crucial inequalities above can be made to be true for any i > 1. The major difficulty 
is the start, since a(0) and 6(0) has to be set 1. This is responsible for the isoperimetric 
considerations and "tightness" in the above computations. 

Remark 3.30. Note that f(x) could be chosen any other way so that (3.29) fails, and the 
tail for the matching is given by Lemma 3.10. All one has to ensure is that the sum on the 
right of (3.29) resulting with this new / is finite when summed up to infinity. E.g. f(x) := 
a; - ^ - (log x log log a:) -1 gives the bound P[o is matched to distance > r] < exp( log ~ c ^ g logr ). 
We have chosen the slightly weaker bound to make the formulas easier to follow. 

Acknowledgements. I am grateful for Ander Holroyd and Gabor Pete for helpful dis- 
cussions. 
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